1,219 research outputs found

    Incorporating rich background knowledge for gene named entity classification and recognition

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Gene named entity classification and recognition are crucial preliminary steps of text mining in biomedical literature. Machine learning based methods have been used in this area with great success. In most state-of-the-art systems, elaborately designed lexical features, such as words, n-grams, and morphology patterns, have played a central part. However, this type of feature tends to cause extreme sparseness in feature space. As a result, out-of-vocabulary (OOV) terms in the training data are not modeled well due to lack of information.</p> <p>Results</p> <p>We propose a general framework for gene named entity representation, called feature coupling generalization (FCG). The basic idea is to generate higher level features using term frequency and co-occurrence information of highly indicative features in huge amount of unlabeled data. We examine its performance in a named entity classification task, which is designed to remove non-gene entries in a large dictionary derived from online resources. The results show that new features generated by FCG outperform lexical features by 5.97 F-score and 10.85 for OOV terms. Also in this framework each extension yields significant improvements and the sparse lexical features can be transformed into both a lower dimensional and more informative representation. A forward maximum match method based on the refined dictionary produces an F-score of 86.2 on BioCreative 2 GM test set. Then we combined the dictionary with a conditional random field (CRF) based gene mention tagger, achieving an F-score of 89.05, which improves the performance of the CRF-based tagger by 4.46 with little impact on the efficiency of the recognition system. A demo of the NER system is available at <url>http://202.118.75.18:8080/bioner</url>.</p

    A Measurement of Rb using a Double Tagging Method

    Get PDF
    The fraction of Z to bbbar events in hadronic Z decays has been measured by the OPAL experiment using the data collected at LEP between 1992 and 1995. The Z to bbbar decays were tagged using displaced secondary vertices, and high momentum electrons and muons. Systematic uncertainties were reduced by measuring the b-tagging efficiency using a double tagging technique. Efficiency correlations between opposite hemispheres of an event are small, and are well understood through comparisons between real and simulated data samples. A value of Rb = 0.2178 +- 0.0011 +- 0.0013 was obtained, where the first error is statistical and the second systematic. The uncertainty on Rc, the fraction of Z to ccbar events in hadronic Z decays, is not included in the errors. The dependence on Rc is Delta(Rb)/Rb = -0.056*Delta(Rc)/Rc where Delta(Rc) is the deviation of Rc from the value 0.172 predicted by the Standard Model. The result for Rb agrees with the value of 0.2155 +- 0.0003 predicted by the Standard Model.Comment: 42 pages, LaTeX, 14 eps figures included, submitted to European Physical Journal

    Measurement of the B+ and B-0 lifetimes and search for CP(T) violation using reconstructed secondary vertices

    Get PDF
    The lifetimes of the B+ and B-0 mesons, and their ratio, have been measured in the OPAL experiment using 2.4 million hadronic Z(0) decays recorded at LEP. Z(0) --> b (b) over bar decays were tagged using displaced secondary vertices and high momentum electrons and muons. The lifetimes were then measured using well-reconstructed charged and neutral secondary vertices selected in this tagged data sample. The results aretau(B+) = 1.643 +/- 0.037 +/- 0.025 pstau(Bo) = 1.523 +/- 0.057 +/- 0.053 pstau(B+)/tau(Bo) = 1.079 +/- 0.064 +/- 0.041,where in each case the first error is statistical and the second systematic.A larger data sample of 3.1 million hadronic Z(o) decays has been used to search for CP and CPT violating effects by comparison of inclusive b and (b) over bar hadron decays, No evidence fur such effects is seen. The CP violation parameter Re(epsilon(B)) is measured to be Re(epsilon(B)) = 0.001 +/- 0.014 +/- 0.003and the fractional difference between b and (b) over bar hadron lifetimes is measured to(Delta tau/tau)(b) = tau(b hadron) - tau((b) over bar hadron)/tau(average) = -0.001 +/- 0.012 +/- 0.008

    Measurement of the running of the QED coupling in small-angle Bhabha scattering at LEP

    Full text link
    Using the OPAL detector at LEP, the running of the effective QED coupling alpha(t) is measured for space-like momentum transfer from the angular distribution of small-angle Bhabha scattering. In an almost ideal QED framework, with very favourable experimental conditions, we obtain: Delta alpha(-6.07GeV^2) - Delta alpha(-1.81GeV^2) = (440 pm 58 pm 43 pm 30) X 10^-5, where the first error is statistical, the second is the experimental systematic and the third is the theoretical uncertainty. This agrees with current evaluations of alpha(t).The null hypothesis that alpha remains constant within the above interval of -t is excluded with a significance above 5sigma. Similarly, our results are inconsistent at the level of 3sigma with the hypothesis that only leptonic loops contribute to the running. This is currently the most significant direct measurment where the running alpha(t) is probed differentially within the measured t range.Comment: 43 pages, 12 figures, Submitted to Euro. Phys. J.

    Relevant microclimate for determining the development rate of malaria mosquitoes and possible implications of climate change

    Get PDF
    Background The relationship between mosquito development and temperature is one of the keys to understanding the current and future dynamics and distribution of vector-borne diseases such as malaria. Many process-based models use mean air temperature to estimate larval development times, and hence adult vector densities and/or malaria risk. Methods Water temperatures in three different-sized water pools, as well as the adjacent air temperature in lowland and highland sites in western Kenya were monitored. Both air and water temperatures were fed into a widely-applied temperature-dependent development model for Anopheles gambiae immatures, and subsequently their impact on predicted vector abundance was assessed. Results Mean water temperature in typical mosquito breeding sites was 4-6°C higher than the mean temperature of the adjacent air, resulting in larval development rates, and hence population growth rates, that are much higher than predicted based on air temperature. On the other hand, due to the non-linearities in the relationship between temperature and larval development rate, together with a marginal buffering in the increase in water temperature compared with air temperature, the relative increases in larval development rates predicted due to climate change are substantially less. Conclusions Existing models will tend to underestimate mosquito population growth under current conditions, and may overestimate relative increases in population growth under future climate change. These results highlight the need for better integration of biological and environmental information at the scale relevant to mosquito biology

    Serum Angiopoietin-1 and -2 Levels Discriminate Cerebral Malaria from Uncomplicated Malaria and Predict Clinical Outcome in African Children

    Get PDF
    BACKGROUND: Limited tools exist to identify which individuals infected with Plasmodium falciparum are at risk of developing serious complications such as cerebral malaria (CM). The objective of this study was to assess serum biomarkers that differentiate between CM and non-CM, with the long-term goal of developing a clinically informative prognostic test for severe malaria. METHODOLOGY/PRINCIPAL FINDINGS: Based on the hypothesis that endothelial activation and blood-brain-barrier dysfunction contribute to CM pathogenesis, we examined the endothelial regulators, angiopoietin-1 (ANG-1) and angiopoietin-2 (ANG-2), in serum samples from P. falciparum-infected patients with uncomplicated malaria (UM) or CM, from two diverse populations--Thai adults and Ugandan children. Angiopoietin levels were compared to tumour necrosis factor (TNF). In both populations, ANG-1 levels were significantly decreased and ANG-2 levels were significantly increased in CM versus UM and healthy controls (p<0.001). TNF was significantly elevated in CM in the Thai adult population (p<0.001), but did not discriminate well between CM and UM in African children. Receiver operating characteristic curve analysis showed that ANG-1 and the ratio of ANG-2:ANG-1 accurately discriminated CM patients from UM in both populations. Applied as a diagnostic test, ANG-1 had a sensitivity and specificity of 100% for distinguishing CM from UM in Thai adults and 70% and 75%, respectively, for Ugandan children. Across both populations the likelihood ratio of CM given a positive test (ANG-1<15 ng/mL) was 4.1 (2.7-6.5) and the likelihood ratio of CM given a negative test was 0.29 (0.20-0.42). Moreover, low ANG-1 levels at presentation predicted subsequent mortality in children with CM (p = 0.027). CONCLUSIONS/SIGNIFICANCE: ANG-1 and the ANG-2/1 ratio are promising clinically informative biomarkers for CM. Additional studies should address their utility as prognostic biomarkers and potential therapeutic targets in severe malaria

    A search for the decay modes B+/- to h+/- tau l

    Get PDF
    We present a search for the lepton flavor violating decay modes B+/- to h+/- tau l (h= K,pi; l= e,mu) using the BaBar data sample, which corresponds to 472 million BBbar pairs. The search uses events where one B meson is fully reconstructed in one of several hadronic final states. Using the momenta of the reconstructed B, h, and l candidates, we are able to fully determine the tau four-momentum. The resulting tau candidate mass is our main discriminant against combinatorial background. We see no evidence for B+/- to h+/- tau l decays and set a 90% confidence level upper limit on each branching fraction at the level of a few times 10^-5.Comment: 15 pages, 7 figures, submitted to Phys. Rev.

    Evidence for an excess of B -> D(*) Tau Nu decays

    Get PDF
    Based on the full BaBar data sample, we report improved measurements of the ratios R(D(*)) = B(B -> D(*) Tau Nu)/B(B -> D(*) l Nu), where l is either e or mu. These ratios are sensitive to new physics contributions in the form of a charged Higgs boson. We measure R(D) = 0.440 +- 0.058 +- 0.042 and R(D*) = 0.332 +- 0.024 +- 0.018, which exceed the Standard Model expectations by 2.0 sigma and 2.7 sigma, respectively. Taken together, our results disagree with these expectations at the 3.4 sigma level. This excess cannot be explained by a charged Higgs boson in the type II two-Higgs-doublet model. We also report the observation of the decay B -> D Tau Nu, with a significance of 6.8 sigma.Comment: Expanded section on systematics, text corrections, improved the format of Figure 2 and included the effect of the change of the Tau polarization due to the charged Higg

    Search for rare quark-annihilation decays, B --> Ds(*) Phi

    Full text link
    We report on searches for B- --> Ds- Phi and B- --> Ds*- Phi. In the context of the Standard Model, these decays are expected to be highly suppressed since they proceed through annihilation of the b and u-bar quarks in the B- meson. Our results are based on 234 million Upsilon(4S) --> B Bbar decays collected with the BABAR detector at SLAC. We find no evidence for these decays, and we set Bayesian 90% confidence level upper limits on the branching fractions BF(B- --> Ds- Phi) Ds*- Phi)<1.2x10^(-5). These results are consistent with Standard Model expectations.Comment: 8 pages, 3 postscript figues, submitted to Phys. Rev. D (Rapid Communications
    corecore